All the figures and tables contained in this report are available with publication-ready quality in the output directory of the OROCHI user.

For each of the sections shown in this report, these are the data used:

Read-based

*All read-based analysis plots were made using MicrobiotaProcess. For more information, consult their documentation.

Alpha diversity

  1. Observed and Shannon indexes for all samples, regardless of treatment
  2. Boxplot and violin plot of Observed and Shannon indexes per treatment

Taxonomic abundance

Average read counts per treatment plotted per taxonomic level. You can naviagte each level by clicking the buttons underneath.

Remember: You already have all these plots (per taxonomic level), there is no need to download them again. Simply look for them in {OUTDIR}/results/09_plots/1-Reads/

For all the samples, using a heatmap maybe be a better option to plot all this information in a more understandable way, where samples can also be seen. Columns correspond to samples, rows correspond to the labels found in the currently displayed taxonomic level. You can navigate each level by clicking the buttons underneath.

Beta diversity

Bray-Curtis dissimilarity indices are used to calculate pairwise distances between all samples (even the ones that belong to the same treatment), and significance is calculated (Wilcoxon test)

PCoA of all samples. Colors correspond to treatment, and boxplots on the right and top side indicate the distribution of the samples of each treatment along each coordinate. 95% confidence intervals are also plotted.

Clustering

Dendrogram showing grouping of the whole dataset based on hierarchical cluster analysis, colored based on the treatments found in the metadata fed by the user.

Bin-based

Bin quality

Interactive plots showing the completeness and contamination of each bin based on CheckM results. The threshold that defines whether a bin is good quality or not is dependant on the type of sample processed. In general, we suggest to look for a minimum of 70% completeness and a maximum of 10% contamination. You can find more data about your bins in {OUTDIR}/results/06_binning.

A_bins_scatterplot.html


B_bins_scatterplot.html


MAG linkage

(From left to right) Relation between 16S sequences, contigs from each assembly (indicated at the top of the plot), and refined bins from your analysis. Phyloflash and MarkerMAG are used to extract and assemble 16S rRNA gene sequences from your data and link them to the more appropriate contigs and bins. The size of the fragment indicates the strength of the linkage - an approximation derived from the number of reads that matched -, and the color of the The width of the connections corresponds to the strength or number of linkages between elements. Color intensity in the flow indicates different linkage groups within the same round and the different shades of grey (if present) indicate which round of MarkerMAG estimation that linkage comes from. You can find more about this process in MarkerMAG Official GitHub page.